Benchmarking Spatial Data Warehouses
نویسندگان
چکیده
Spatial data warehouses (SDW) enable analytical multidimensional queries together with spatial analysis. Mainly, three operations are related to SDW query processing performance: (i) joining large fact tables and large spatial and non-spatial dimension tables; (ii) computing one or more costly spatial predicates based on spatial ad hoc query windows; and (iii) aggregating data according to different spatial granularity levels. Several techniques to improve the query processing performance over SDW have been proposed in the literature. However, we identified the lack of a benchmark to carry out a controlled experimental evaluation of such techniques and, principally, to effectively measure the costs of the aforementioned three complex operations. In this paper, we propose a novel spatial data warehouse benchmark, called Spadawan, to provide performance evaluation environments for SDW and enable a further investigation on spatial data redundancy. The Spadawan benchmark is available at http://gbd.dc.ufscar.br/spadawan.
منابع مشابه
Multidimensional benchmarking in data warehouses
Benchmarking is among the most widely adopted practices in business today. However, to the best of our knowledge, conducting multidimensional benchmarking in data warehouses has not been explored from a technical efficiency perspective. In this paper, we formulate benchmark queries in the context of data warehousing and business intelligence, and develop algorithms to answer benchmark queries e...
متن کاملWarehouse Benchmarking Results: a Comparison of Wholesale and Manufacturing Warehouses
Warehouses are a substantial component of GDP and a significant contributor to speed and cost in supply chains. An analysis of a cross section of warehouse performance data would provide a better understanding of warehouse technical efficiency, the factors contributing to efficiency and the best practices for improving efficiency. This understanding would improve the practice of warehousing, re...
متن کاملSpatial and Spatio-Temporal Multidimensional Data Modelling: A Survey
Data warehouse store and provide access to large volume of historical data supporting the strategic decisions of organisations. Data warehouse is based on a multidimensional model which allow to express user’s needs for supporting the decision making process. Since it is estimated that 80% of data used for decision making has a spatial or location component [1, 2], spatial data have been widely...
متن کاملOn the Requirements for User-Centric Spatial Data Warehousing and SOLAP
Data warehouses and OLAP systems help to analyze complex multidimensional data and provide decision support. With the availability of large amounts of spatial data in recent years, several new models have been proposed to enable the integration of spatial data in data warehouses and to help analyze such data. This is often achieved by a combination of GIS and spatial analysis tools with OLAP an...
متن کاملUser-Centric Spatial DataWarehousing: A Survey of Requirements & Approaches
Data warehouses have traditionally been used to analyse large multidimensional datasets and provide enterprise decision support. With an increased availability of spatial data in recent years, several new strategies have been proposed to enable their integration into data warehouses and OLAP systems and perform complex analysis on them. One strategy to achieve this is to integrate existing Geog...
متن کامل